Search CORE

38 research outputs found

Identifying beneficial task relations for multi-task learning in deep neural networks

Author: Bingel Joachim
Søgaard Anders
Publication venue
Publication date: 01/01/2017
Field of study

Multi-task learning (MTL) in deep neural networks for NLP has recently received increasing interest due to some compelling benefits, including its potential to efficiently regularize models and to reduce the need for labeled data. While it has brought significant improvements in a number of NLP tasks, mixed results have been reported, and little is known about the conditions under which MTL leads to gains in NLP. This paper sheds light on the specific task relations that can lead to gains from MTL models over single-task setups.Comment: Accepted for publication at EACL 201

arXiv.org e-Print Archive

Crossref

Copenhagen University Research Information System

Latent Multi-task Architecture Learning

Author: Augenstein Isabelle
Bingel Joachim
Ruder Sebastian
Søgaard Anders
Publication venue
Publication date: 19/11/2018
Field of study

Multi-task learning (MTL) allows deep neural networks to learn from related tasks by sharing parameters with other networks. In practice, however, MTL involves searching an enormous space of possible parameter sharing architectures to find (a) the layers or subspaces that benefit from sharing, (b) the appropriate amount of sharing, and (c) the appropriate relative weights of the different task losses. Recent work has addressed each of the above problems in isolation. In this work we present an approach that learns a latent multi-task architecture that jointly addresses (a)--(c). We present experiments on synthetic data and data from OntoNotes 5.0, including four different tasks and seven different domains. Our extension consistently outperforms previous approaches to learning latent architectures for multi-task problems and achieves up to 15% average error reductions over common approaches to MTL.Comment: To appear in Proceedings of AAAI 201

arXiv.org e-Print Archive

Copenhagen University Research Information System

Named entity tagging a very large unbalanced corpus: training and evaluating NE classifiers

Author: Bingel Joachim
Haider Thomas
Publication venue: Reykjavik : European Language Resources Association (ELRA)
Publication date: 01/01/2014
Field of study

We describe a systematic and application-oriented approach to training and evaluating named entity recognition and classification (NERC) systems, the purpose of which is to identify an optimal system and to train an optimal model for named entity tagging DeReKo, a very large general-purpose corpus of contemporary German (Kupietz et al., 2010). DeReKo 's strong dispersion wrt. genre, register and time forces us to base our decision for a specific NERC system on an evaluation performed on a representative sample of DeReKo instead of performance figures that have been reported for the individual NERC systems when evaluated on more uniform and less diverse data. We create and manually annotate such a representative sample as evaluation data for three different NERC systems, for each of which various models are learnt on multiple training data. The proposed sampling method can be viewed as a generally applicable method for sampling evaluation data from an unbalanced target corpus for any sort of natural language processing

Copenhagen University Research Information System

Publikationsserver des Instituts für Deutsche Sprache

MPG.PuRe

Disembodied Machine Learning: On the Illusion of Objectivity in NLP

Author: Augenstein Isabelle
Bingel Joachim
Lulz Smarika
Waseem Zeerak
Publication venue
Publication date: 01/01/2020
Field of study

Machine Learning seeks to identify and encode bodies of knowledge within provided datasets. However, data encodes subjective content, which determines the possible outcomes of the models trained on it. Because such subjectivity enables marginalisation of parts of society, it is termed (social) `bias' and sought to be removed. In this paper, we contextualise this discourse of bias in the ML community against the subjective choices in the development process. Through a consideration of how choices in data and model development construct subjectivity, or biases that are represented in a model, we argue that addressing and mitigating biases is near-impossible. This is because both data and ML models are objects for which meaning is made in each step of the development pipeline, from data selection over annotation to model training and analysis. Accordingly, we find the prevalent discourse of bias limiting in its ability to address social marginalisation. We recommend to be conscientious of this, and to accept that de-biasing methods only correct for a fraction of biases.Comment: In revie

arXiv.org e-Print Archive

Copenhagen University Research Information System

Learning what to share between loosely related tasks

Author: Augenstein Isabelle
Bingel Joachim
Ruder Sebastian
Søgaard Anders
Publication venue
Publication date: 23/05/2017
Field of study

Copenhagen University Research Information System

CoAStaL at SemEval-2019 Task 3:Affect Classification in Dialogue using Attentive BiLSTMs

Author: Bingel Joachim
González Ana Valeria
Petrén Bach Hansen Victor
Søgaard Anders
Publication venue: 'Association for Computational Linguistics (ACL)'
Publication date: 01/01/2019
Field of study

Crossref

Copenhagen University Research Information System

Sequence classification with human attention

Author: Barrett Maria Jung
Bingel Joachim
Hollenstein Nora
Rei Marek
Søgaard Anders
Publication venue: 'Association for Computational Linguistics (ACL)'
Publication date: 01/01/2018
Field of study

Copenhagen University Research Information System

Weakly Supervised Part-of-speech Tagging Using Eye-tracking Data

Author: Barrett Maria Jung
Bingel Joachim
Keller Frank
Søgaard Anders
Publication venue: 'Association for Computational Linguistics (ACL)'
Publication date: 01/01/2016
Field of study

Crossref

Copenhagen University Research Information System

Edinburgh Research Explorer

Data-driven sentence simplification: Survey and benchmark

Author: Abend Omri
Alva-Manchego Fernando
Alva-Manchego Fernando
Artetxe Mikel
Bach Nguyen
Bahdanau Dzmitry
Bingel Joachim
Biran Or
Bott Stefan
Bott Stefan
Bott Stefan
Carolina Scarton
Carroll John
Caseli Helena M.
Coster William
Coster William
Damay Jerwin Jan S.
De Belder Jan
Devlin Siobhan
Eom Soojeong
Feblowitz Dan
Fernando Alva-Manchego
Ganitkevitch Juri
Glavaš Goran
Gonzalez-Dios Itziar
Goodfellow Ian
Goto Isao
Guo Han
Heilman Michael
Kajiwara Tomoyuki
Kandula Sasikiran
Kauchak David
Klaper David
Klein Guillaume
Klerke Sigrid
Lin Chin Yew
Lucia Specia
Mandya Angrosh
Mikolov Tomas
Mirkin Shachar
Napoles Courtney
Narayan Shashi
Niklaus Christina
Ogden Charles Kay
Paetzold Gustavo
Paetzold Gustavo H.
Paetzold Gustavo Henrique
Papineni Kishore
Petersen Sarah E.
Post Matt
Quigley S. P.
Ranzato Marc’Aurelio
Robbins N. L.
Scarton Carolina
Scarton Carolina
Shardlow Matthew
Shewan Cynthia M.
Siddharthan Advaith
Siddharthan Advaith
Silveira Sara Botelho
Snover Matthew
Sun Hong
Vaswani Ashish
Vickrey David
Woodsend Kristian
Woodsend Kristian
Wubben Sander
Yatskar Mark
Zhang Xingxing
Zhu Zhemin
Štajner Sanja
Štajner Sanja
Štajner Sanja
Štajner Sanja
Publication venue: 'MIT Press - Journals'
Publication date: 15/09/2019
Field of study

Sentence Simplification (SS) aims to modify a sentence in order to make it easier to read and understand. In order to do so, several rewriting transformations can be performed such as replacement, reordering, and splitting. Executing these transformations while keeping sentences grammatical, preserving their main idea, and generating simpler output, is a challenging and still far from solved problem. In this article, we survey research on SS, focusing on approaches that attempt to learn how to simplify using corpora of aligned original-simplified sentence pairs in English, which is the dominant paradigm nowadays. We also include a benchmark of different approaches on common datasets so as to compare them and highlight their strengths and limitations. We expect that this survey will serve as a starting point for researchers interested in the task and help spark new ideas for future developments

Crossref

Online Research @ Cardiff

Spiral - Imperial College Digital Repository

White Rose Research Online

KoralQuery - a General Corpus Query Protocol

Author: Bingel Joachim
Diewald Nils
Publication venue: Linköping University Electronic Press, Linköpings universitet
Publication date: 01/01/2015
Field of study

The task-oriented and format-driven development of corpus query systems has led to the creation of numerous corpus query languages (QLs) that vary strongly in expressiveness and syntax. This is a severe impediment for the interoperability of corpus analysis systems, which lack a common protocol. In this paper, we present KoralQuery, a JSON-LD based general corpus query protocol, aiming to be independent of particular QLs, tasks and corpus formats. In addition to describing the system of types and operations that Koral- Query is built on, we exemplify the representation of corpus queries in the serialized format and illustrate use cases in the KorAP project

Copenhagen University Research Information System

Publikationsserver des Instituts für Deutsche Sprache